Prediction of Protein Function by Discriminant Analysis*

نویسندگان

  • PETR KLEIN
  • JOHN A. JACQUEZ
چکیده

Approximately 53% of the protein sequences in the National Biomedical Research Foundation (NBRF) database can be allocated to one of 26 functional classes, each of which can be characterized by the joint occurrence of four or fewer attributes. The attributes reflect collective physicochemical properties of the sequences in a class, ranging from simple characteristics of composition, such as average hydrophobicity and net charge, to amphipathicity and the propensities of various residues to be in certain preferred configurations. In some, though not all instances, these variables can be related in a general way to topological or other structural features of the particular class they characterize. We show that the attributes permit 17 of the 26 groups to be filtered from all other proteins in the database with a misclassification error of less than 2%, and that the remaining 9 groups can be filtered with errors not exceeding 13%. Thus for a given functional class, the results point to the existence of relatively few characteristic variables which capture most of the intraclass similarity and interclass variability that is common and peculiar to members of that class.

منابع مشابه

The Prediction Dependency on Virtual Social Networks Based on Alexithymia, Attachment Styles, Well-Being Psychological and Loneliness

Introduction: Virtual social networks like others type of addiction can be affected by psychological, developmental, and emotional problems. So, the aim of this research is to The purpose of this study was to investigate prediction dependency on virtual social networks based on alexithymia, attachment styles, well-being psychological and loneliness. The research design was a two-group diagnosti...

متن کامل

Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis

  Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population.   Methods : In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were ...

متن کامل

A prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)

Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...

متن کامل

A prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)

Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...

متن کامل

Pinpointing the classifiers of English language writing ability: A discriminant function analysis approach

The  major  aim  of  this  paper  was  to  investigate  the  validity  of  language  and intelligence  factors  for  classifying  Iranian  English  learners`  writing  performance. Iranian  participants  of  the  study  took  three  tests  for  grammar,  breadth,  and  depth  of vocabulary, and two tests for verbal and narrative intelligence. They also produced a corpus  of  argumentative  writ...

متن کامل

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001